WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 7 
G01N 31/00, G06F 19/00 



Al 



(11) International Publication Number: WO 00/50889 

(43) International Publication Date: 31 August 2000 (3 1 .08.00) 



(21) International Application Number: PCT/US00/04338 

(22) International Filing Date: 22 February 2000 (22.02.00) 



(30) Priority Data: 

60/121,432 



23 February 1999 (23.02.99) US 



(63) Related by Continuation (CON) or Continuation-in-Part 
(CIP) to Earlier Application 

US 60/121,432 (CIP) 

Filed on 23 February 1999 (23.02.99) 



(71) Applicant (for all designated States except US); 

WARNER-LAMBERT COMPANY [US/US]; 2800 
Plymouth Road, Anne Arbor, MI 48105 (US). 

(72) Inventor; and 

(75) Inventor/Applicant (for US only): ROGERS, John, C. 
[US/US]; 1 1615 Pleasant lake Road, Manchester, MI 48158 
(US). 

(74) Agents: KOKULIS, Paul, N. et al.; Pillsbury Madison & Sutro, 
LLP, 1100 New York Avenue, N.W., Washington, DC 
20005 (US). 



(81) Designated States: AE, AL, AM, AT, AU, AZ, BA, BB, BG, 
BR, BY, CA, CH, CN, CR, CU, CZ, DE, DK, DM, EE, 
ES, FI, GB, GD, GE, GH, GM, HR, HU, ID, IL, IN, IS, JP, 
KE, KG, KP, KR, KZ, LC, LK, LR, LS, LT, LU, LV, MA, 
MD, MG, MIC, MN, MW, MX, NO, NZ, PL, PT, RO, RU, 
SD, SE, SG, SI, SK, SL, TJ, TM, TR, TT, TZ, UA, UG, 
US, UZ, VN, YU, ZA, ZW, ARIPO patent (GH, GM, KE, 
LS, MW, SD, SL, SZ, TZ, UG, ZW), Eurasian patent (AM, 
AZ, BY, KG, KZ, MD, RU, TJ, TM), European patent (AT, 
BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, 
MC, NL, PT, SE), OAPI patent (BF, BJ, CF, CG, CI, CM, 
GA, GN, GW, ML, MR, NE, SN, TD, TG). 



Published 

With international search report. 

Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 



(54) Title: SYSTEM AND METHOD FOR MANAGING AND PRESENTING INFORMATION DERIVED FROM GENE EXPRESSION 
PROFILING 



(57) Abstract 

A data processing system and method for managing and presenting information derived from a differential expression of genetic 
information which can be used to model a physiological response of biological cells. A data-represented map of metabolic pathways is 
provided. The map has a matrix of regions and locations within each of the regions. The regions are denned by respective coordinate sets. 
Each of the metabolic pathways include, at certain ones of the locations, respective graphical representations of a reaction, a substrate and 
a product of the reaction, and at least one biological catalyst of the reaction. An association mechanism links previously generated sets 
of affected sequence data to biological catalyst identifiers, each identifying one or more unique biological catalayts, and which links the 
biological catalyst identifiers to the coordinate sets. The sets of affected sequence data represent a direction and a magnitude of regulation 
of each one of a number of different nucleic acid sequences. 




FOR THE PURPOSES OF INFORMATION ONLY 
Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


ZW 


Zimbabwe 


CI 


Cdte d'Tvoire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






CU 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






CZ 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


LI 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







WO 00/50889 



PCT/US00/04338. 



SYSTEM AND METHOD FOR MANAGING AND PRESENTING INFORMATION 
DERIVED FROM GENE EXPRESSION PROFILING 

BACKGROUND OF THE INVENTION 

1. Copyright Notice. 

5 Certain portions of this patent document may be subject to copyright protection. 

While the facsimile reproduction by anyone of this patent document, as it appears in the U.S. 
Patent and Trademark Office patent files or records, is permitted, no other use or 
reproduction is permitted, and the copyright owner reserves all copyright rights whatsoever. 

2. Field of the Invention. 

10 The present invention is directed to certain systems and methods for managing and 

presenting information derived from techniques for monitoring differential expression of 
nucleic acid sequences, e.g., gene expression profiling. 

3 . Description of Background Information. 

Gene expression profiling processes are commonly used to represent a cell's 
15 physiological response to a particular compound, treatment, or disease. For example, a 
January 1, 1999 article by Iyer et al., Volume 283, Science, at pages 83-87 
(www.sciencemag.org V discloses the use of a temporal program of gene expression to 
represent a physiological response of human cells to a treatment — particularly, the response 
of fibroblasts to serum. A cDNA microarray was used, representing over 8,600 distinct 
20 human genes. Fibroblasts, cultured from human neonatal foreskin, were placed in a quiescent 
state by depriving the cells of serum for 48 hours. The fibroblasts were then stimulated by 
adding a medium containing 10% FBS, and the microarray was then used to measure the 
levels of 8,613 different mRNA sequences at 12 distinct times. The microarray was used to 
identify those genes (including expressed sequence tags — ESTs) which were substantially 
25 repressed or induced and the extent of repression or induction (i.e., fold change). Five 

hundred seventeen genes whose mRNA levels changed in response to the treatment were 
selected, and graphically depicted in accordance with a hierarchy. 

From this information, various proteins could be identified, which were categorized 
according to their biological functions. Those biological function categories identified were 
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signal transduction, intermediate-early transcription factors, other transcription factors, cell 
cycle and proliferation, coagulation and hemostasis, inflammation, angiogenesis, tissue 
remodeling, cytoskeletal reorganization, re-epithelialization, cholesterol biosynthesis, and an 
unidentified role in wound healing. 
5 Various technologies are available for expressing large numbers of genes. A small 

sample of the available implementations incorporating those technologies include SAGE 
(serial analysis of gene expression), oligo arrays, and cDNA arrays. 

Those technologies produce data identifying large numbers of expressed genes, and 
the extent of their repression or induction. To aid in the analysis of these large sets of data, 
10 biological computational analysis systems are being developed. An approach typically used 
to create control and treatment probes comprising respective arrays is one used by Iyer et al., 
in which the data from such arrays is presented in the form of a two-dimensional cluster 
image showing the dispersion of gene clusters that are either up or down regulated (induced 
or repressed). 

15 Databases and wall charts have been provided which facilitate the study of treatment 

data. For example, the Boehringer Mannheim biochemical pathways wall chart and the Cell 
Signaling Pathways Chart, distributed by Zymed Laboratories, graphically illustrate select 
metabolic pathways existing in nature, the interrelationships between various of the 
illustrated metabolic pathways (such as connections between the metabolic pathways, and 

20 branching points of substrate metabolism), and factors controlling the direction and the speed 
of turnover from one point to another within a given metabolic pathway. 

There is a need for a system which will better facilitate the analysis of data obtained 
from expression profiling techniques, to more readily identify key metabolic pathway 
information, mechanisms of action, mechanisms of drug inactivation and clearance, and 

25 potential side effects. Such a system will preferably also provide meaningful information that 
assists with the identification of the physiological affects of certain treatments and the 
biological function associated with the affected metabolic activity. 
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4. Definitions 

For purposes of clarification, and to assist readers in an understanding of the present 
invention, and the embodiments disclosed herein, a number of terms used herein are defined 
as follows: 
5 Biological function: 

an inferred functional classification of a given gene, protein, nucleic acid sequence, or 
pathway. Some examples of biological functions are metabolism, angiogenesis, signal 
transduction, transcription factors, cell cycle control, regulation of proliferation, coagulation 
and hemostasis, inflammation, and apoptosis. 
10 Enzyme: 

Protein that catalyzes biochemical reactions. 

Protein molecule: 

One or several polypetide chains of amino acids. 
Expression profiling: 

15 A process by which gene expression techniques are used to measure and compare 

levels of certain nucleic acid sequences (e.g., mRNAs, proteins, genes, ESTs) in a cell- 
derived sample in relation to the levels of the same nucleic acid sequences from a different 
sample or from the same sample at a different time. 
Gene: 

20 A sequence of nucleotides specifying a particular polypeptide chain. 

Metabolic pathway: 

Any individual biological reaction involving a substrate and a product caused by a 
reaction, as well as the catalyst of such reaction. Catalysts of reactions in metabolic 
pathways are typically enzymatic. A metabolic pathway also includes any related series of 
25 such individual reactions. 

Mechanism of action: 

A causal link between a variant and a response to the variant, for example, identifying 
which specific, or where within an individual, metabolic pathway or biological function does 
a compound or treatment act to produce a given physiological effect. For example, if blood 
30 pressure is reduced, the mechanisms of action comprise the specific metabolic pathways and 
biological functions are being acted upon or involved with the reduction of blood pressure. 

mRNA (messenger KNA): 
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An RNA molecule synthesized from a DNA template — by the enzyme RNA 
polymerase. An mRNA functions as a template for the assembly of a polypeptide chain, a 
process known as translation. 

Physiological affect: 

Some physiological change or response. A physiological affect could be a state of a 
given biological system (activation or deactivation), for example a change in high blood 
pressure. 

RNA: 

Ribonucleic acid. 
RNA Polymerase: 

An enzyme that synthesizes RNA by using DNA as a template. 
Transcription: 

A process by which an RNA molecule is synthesized by the enzyme RNA polymerase 
using DNA as a template. 

SUMMARY OF THE INVENTION 

In view of the above, the present invention, through one or more of its various aspects 
and/or embodiments, is thus presented to accomplish one or more objects and advantages 
such as those noted below. 

An object of the present invention is to provide an improved mechanism for 
facilitating the display of meaningful information based upon expression profiling, such 
information facilitating the determination of biological functions involved with treatments, 
compounds, or diseases, the identification of metabolic pathways, and the identification of 
mechanisms of action. A further object of the present invention is to provide a structure for 
organizing and displaying information to enable data mining, whereby expression profile data 
is grouped in accordance with certain metabolic pathway characteristics in a displayed map. 

The present invention, therefore, is directed to a system or method, or one or more 
components thereof, for managing and presenting information derived from differential 
expression of genetic information which can be to model a physiological response of 
biological cells. The system comprises an expression profiling subsystem. The expression 
profiling subsystem generates, from control and treatment sets of cell-derived samples, 
respective sets of sequence data representing a direction and a magnitude of regulation of 
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each one of a high number of different nucleic acid sequences. Sets of nucleic acid 
sequences are associated with particular regions on a map of metabolic pathways of the 
biological organism being studied. An overview of the map coordinates may be provided, 
and those areas or regions of the map comprising high concentrations of affected nucleic acid 
5 sequences may be differentiated from other regions of the map, for example, by having a 
different color. Regions of the map with high concentrations of the affected nucleic acid 
sequences may be viewed in further detail, to view the specific metabolic pathways involved, 
and the role the affected nucleic acid sequences play within such metabolic pathways. 

Alternatively or in addition, an overview may be provided of the map which identifies 
10 specific affected nucleic acid sequences within a given set of metabolic pathways, such 
indications include a first symbol representing a point of inhibition within the set of 
pathways, second symbols representing biological catalyst locations within the set of 
pathways, and third symbols representing locations of end products of the illustrated set of 
metabolic pathways. 

15 

BRIEF DESCRIPTION OF THE DRAWINGS 
The present invention is further described in the detailed description which follows, 
by reference to a noted plurality of drawings, by way of non-limiting exemplary 
embodiments of the present invention, in which like reference numerals represent similar 
20 parts throughout the several views of the drawings and wherein: 

Fig. 1 is a block diagram of a gene expression profiling data analysis system; 
Fig. 2 is a flow diagram of a gene expression profiling process; 
Fig. 3 is a flow diagram of a process for managing information derived from gene 
expression profiling; 

25 Fig. 4 is an overview representation of a biochemical pathway map, indicating the 

concentrations of affected nucleic acid sequences at certain coordinates within the map; 

Fig. 5 is a more detailed blown-up view of certain cells within a given area of the 
biochemical pathway map; 

Fig. 6 shows a given set of biosynthetic pathways affected by feedback inhibition; 
30 Fig. 7 is a diagram of a database structure in accordance with the illustrated 

embodiment; 
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Fig. 8 is a flow chart representing a process performed by the client computer to 
match expression profiling data up with mapped metabolic pathways; 

Fig. 9 is an example of an overview display of the metabolic pathway map in which 
related repressed and induced biological catalysts, a point of inhibition and end products are 
5 represented by symbols; and 

Fig. 10 is a flow diagram of a process of identifying BCIs within affected pathways 
with a simplified set of symbols on the overview map display. 



DETAILED DESCRIPTION OF THE EXEMPLARY EMBODIMENTS 
10 Referring now to the drawings in greater detail, Fig. 1 shows an analysis system 10 

according to the illustrated embodiment of the present invention. An expression profiling 
subsystem 12 is provided, which is coupled to a client computer 14. Client computer 14 
comprises, among other elements, a browser application 16, a human interface 18, and a 
display 20. Human interface 18 may comprise any standard or other interface for facilitating 
15 human interaction with and control of client computer 14, including, for example, a keyboard 
and a mouse. Client computer 14 is coupled to a host computer 24 via a network connection 
illustrated in Fig. 1 as an intranet. Host computer 24 is connected to a database 26. 

Expression profiling system 12 may comprise, for example, an Affymetrix cDNA 
array. It generates, from control and treatment sets of cell-derived samples, respective sets of 
20 sequence data representing a direction and a magnitude of regulation of each one of a high 
number of different nucleic acid sequences. 

Client computer 14, together with human interface 18, display 20, and browser 
application 16, allows a user to operate analysis system 10. Client computer 14 
communicates with database 26 through intranet 22 and host computer 24. Expression 
25 profiling subsystem 12 obtains the expression profiling data and stores that data in an 
organized fashion on database 26. 

Host computer 24 is provided with, among other elements, an analysis application 27 
for carrying out certain analysis process steps associated with expression profiling and 
managing the data acquired from the expression profiling. A database server software 
30 component 28 is provided for handling and acting on database queries and responses. 

Fig. 2 generally shows an expression profiling process in accordance with the 
illustrated embodiment. In an initial step S2, sequences are generated based upon a baseline 
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sample (otherwise referred to as a control sample) of cells. One or more differentiated 
sequences maybe generated based upon treated samples, i.e., samples of cells based upon 
those cells entering into a diseased state or being treated with a particular compound. After 
performing each of respective steps S2 and S6, a cluster algorithm S4 and S8 is performed, in 
5 which similar sequences, including expressed sequence tags (ESTs) are grouped together. 
Clustering of gene sequence pieces allows redundancies to be eliminated, as a gene 
expression array will typically identify not only full gene sequences or full mRNA, but will 
also identify ESTs, which comprise shorter pieces of the full sequence. The total number of 
sequence pieces within a given cluster may be considered to represent the total number of 

10 genes repressed or induced having a particular sequence. 

An alternative method of clustering is to use the expression data to cluster by 
expression patterns, i.e., similar profiles over a course of time. This approach would allow 
comparison between genes having known functions with genes having unknown functions to 
assist in identifying the unknown functions, such as is done by Iyer et aL in the above- 

1 5 identified article. 

In order to determine whether the gene clusters have been substantially affected (i.e., 
either repressed or induced), the number of genes generated in the baseline sample is 
compared with the number of genes generated and clustered in each cluster in the treated 
sample or samples, to produce, for each treated sample, an indication of whether the gene 

20 cluster was regulated and the extent and direction of that regulation. 

More specifically, by way of example, a sample of cells may be sequenced using an 
expression profiling array, such as an Affymetrix GeneChip™ probe array for, for example, 
the human genome, which is capable of detecting over 6,000 sequences for that genome. 
Affymetrix provides a GeneChip™ fluidics station which automates the hybridization of 

25 nucleic acid targets to a probe array cartridge, and thus controls the delivery of reagents and 
the timing and temperature for hybridization. Each fluidics station can independently process 
four probe arrays at a given time. 

Accordingly, each target may be prepared from a set of cell dishes by isolation of 
RNA over a course of time. The treatment of those cells may be emulated by adding, for 

30 example, serum thereto. At predetermined intervals, a small amount of the fluid is removed, 
and the cells are put in a quiescent state to stop the reaction time. Accordingly, a large set of 
targets, having a predetermined amount of liquid (e.g., .5 ml each) is produced. The 
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GeneChip™ fluidics station will then automatically hybridize each target, i.e., it will extract 
all the RNA and label the RNA by adding a chemical tag to each molecule, and control the 
delivery of the resulting liquid to the probe arrays to facilitate the obtaining of sequencing 
information regarding the mRNAs. This is done by the probe arrays exposing the target to 
5 light at a predetermined location and measuring the photons collected at various locations 
within the arrays. The amount of mRNA (or an EST) is then ascertained based upon the 
signal strength of the reading given by the probe at the appropriate location corresponding to 
that sequence or sequence segment. 

Fig. 3 is a flowchart of an analysis process performed by the illustrated embodiment. 

10 In a first step S20, gene expression profiling is performed, at which time respective sets of 
sequence data are generated from control and treatment sets of cell-derived samples/targets, 
and the obtained data includes information regarding the direction and magnitude of 
regulation of each one of a high number of different nucleic acid sequence clusters. Once 
gene expression profiling is performed at step S20, a set of data D2 is produced which 

15 comprises the identified sequences and associated regulation information. Then, at step S22, 
each sequence cluster is matched to a biological catalyst identifier (BCI). In the illustrated 
embodiment, the BCI may comprise, for example, an EC number. EC numbers are part of a 
known system for enzyme classification. Each EC number comprises a first number which 
refers to one of six main subdivisions, a second number which indicates a subclass, a third 

20 number indicating a sub-subclass, and a fourth number which represents a serial number. 

The major EC classes include (1) oxodoreductases-redox reactions (2) transferases ~ transfer 
a group (CH 3 ), (3) hydrolases - cleavage, H 2 0, (4) lyases — cleavage by elimination, (5) 
isomerases — geometric changes, and (6) ligases ~ coupled to ATP hydrolysis. As an 
example of some subclasses, the oxoreductases are as follows: 

25 1. Oxoreductases 
LI. CHOH doners 

1.1.1 NAD + or NADP + acceptor 

1.1.2 Cytochrome acceptor 
1.1.3. Oxygen acceptor 

30 1.1.5 Quinone acceptor 

1.1.99 Other acceptor 
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At step S24, each cluster of affected sequences, i.e., sequences that have been 
significantly regulated (by at least twofold) is categorized in accordance with its cluster, 
whether it was up or down regulated (i.e., induced or repressed, respectively) and the extent 
of regulation, and further, the number of regulated sequences or sequence segments (ESTs) 
5 falling within a given cell of contour plot 30 is summed and binned in association with that 
cell. This is performed at step S26. 

At step S28, a summed section of a detailed map view is displayed, which includes 
metabolic pathways corresponding to substantially affected sequences. Fig. 4 illustrates a 
contour plot view 30 of a biochemical pathway map, which illustrates and helps clarify the 

10 acts performed in steps S24 and S26 of the process illustrated in Fig. 3. 

More specifically, Fig. 4 is a representation of a contour plot view of a biochemical 
pathway map. In the illustrated embodiment, the map corresponds to the biochemical 
pathways wall chart by Boehringer Mannheim. The map may comprise graphic 
representations of biochemical pathways which are identical or comparable to the Boehringer 

15 Mannheim wall chart, or any other appropriate set of graphical representations of 

biochemical pathways, where a given pathway, or point within a pathway, is associated with 
a particular set of coordinates within the map. In the illustrated embodiment, a matrix of cells 
is provided, comprising fourteen columns along the X direction (XI -XI 4) and eight rows 
along the Y direction (Y1-Y8). The contour plot view shown in Fig. 4 shows whether the 

20 number of sequences having an EC number within a given cell is within one of five 

prescribed ranges. Those ranges are depicted by a different pattern, and include 0,1, 2-3, 4-5, 
6-7, and 8-10. By way of example, the cell at coordinates X8, Y7 has one sequence having 
an EC number falling within that cell. The cell at X7, Y7 has seven sequences with an EC 
number falling within that cell. Accordingly, the cell X8, Y7 is illustrated as falling within 

25 the second range, 1, and the cell at X7, Y7 is shown as having a number of sequences falling 
within the range 6-7. 

While patterns are shown in Fig. 4 in order to differentiate between different ranges of 
sequences having an EC number falling within a given cell, it is preferred that the ranges be 
depicted with the use of a coloring scheme. By way of example, the range 1 could be 
30 represented by the color purple, while the range 2-3 is represented by the color green, the 

range 4-5 is represented by the color yellow, the range 6-7 is represented by the color orange, 
and the range 8-10 is represented by the color red. 
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The view provided by the contour plot shown in Fig. 4 can thus provide a quick 
overall view of the activity throughout the various areas of the pathway map, and those areas 
having yellow, orange and red colors indicate those areas with the most activity. 
Accordingly, one can select areas in accordance with the amount of activity to view a more 
5 detailed view of the map. 

Fig. 5 shows a small portion of a biochemical pathway map which illustrates various 
aspects of certain biochemical pathways at prescribed coordinates x n -i, x n and x n +i along the 
x direction, and y m _i , y m , and y m +i along the y direction. The map comprises graphical 
representations of metabolic pathways. Those graphical representations comprise individual 
10 graphic representations of such items as substrates, products, biological catalysts (BCs), 
inhibitors, biological functions, and pathway directions (including unique graphical 
identifiers showing a direction of a pathway in one direction versus the opposite direction, 
and an amphibolic pathway direction which indicates that the reaction can go in either 
direction). 

15 More specifically, as shown in Fig. 5, a plurality of pathway direction symbols 40a- 

40d are provided in the section of the map shown in Fig. 5. The use of an arrow at each end 
of the illustrated lines 40b and 40c indicates that the pathway direction is amphibolic. A 
plurality of substrate/product symbols 42a-42c are provided which represent 
substrate/products 2 and 3. Those symbols may comprise, for example, text identifying a 

20 given compound which may serve as either a substrate or a product, depending upon the 
direction of the chemical reaction. Each biological catalyst or set of biological catalysts 
associated with the particular pathway, including biological catalyst(s)i and biological 
catalyst(s)2 in the illustrated embodiment, is illustrated with a respective biological catalyst 
symbol 44a,b adjacent to the pathway direction symbol. 

25 A block is provided for indicating a biological catalyst symbol 44a and 44b. These 

symbols may simply comprise a textual representation of the common nomenclature for the 
given biological catalyst, which typically will comprise an enzyme in the case of metabolic 
pathways. BCI (biological catalyst index) symbol 46a, 46b is provided adjacent its respective 
biological catalyst symbol 44a, 44b, and in the illustrated embodiment simply comprises a 

30 numerical representation of the BCL Any inhibitors will be represented with inhibitor 
symbols 48a, 48b, which, in the illustrated embodiment, may simply comprise text 
representing the inhibitor using standard nomenclature. 

10 
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The biological function with which the metabolic pathways in a certain region of the 
map are associated may be represented with a biological function symbol 50, which, in the 
illustrated embodiment, comprises a text representation of the biological function using 
common nomenclature. Some example biological functions include fatty acid oxidation, 
5 carotenoids, and ketone bodies. Other functions include, for example, sulphur metabolism 
and pterine biosynthesis. 

In the illustrated embodiment, one or more of the graphic representations may have a 
unique color to identify the type of information it is representing. For example, the text 
serving as BCI symbols 46a, 46b may be in green, the text serving as the biological catalyst 

10 symbols 44a, 44b may be magenta or aqua, the text serving as the inhibitor symbols 8a, 48b 
may be the color brown, and the text serving as the biological function symbol 50 may be the 
color blue. Additional or alternative coloring schemes may be used. Also, unique graphical 
patterns may be used in addition or instead of colors to facilitate the viewers ready 
identification or classification of a particular symbol as representing one type of information 

1 5 versus another. The enzymes shown in Fig. 5 may have two colors, one if it is induced (up 
regulated), and another if it is repressed (down regulated). Accordingly, in the illustrated 
embodiment, biological catalysts 44a and 44b are magenta and aqua, respectively, indicating 
that biological catalyst (s)i 44a was induced, while biological catalyst (s)2 44b was repressed 
(down regulated). 

20 By mapping sequences obtained from expression profiling techniques to specific 

symbols within a metabolic pathway map, such as shown in Fig. 5, the information provided 
by the expression profiling data can be quickly related to meaningful pieces of information 
relevant to key concerns associated with the treatment, disease, or compound being applied to 
the tested cells. The visualization of the results of the expression profiling experiment is 

25 enabled by identifying such valuable pieces of information as biological function 

(represented by a biological function symbol 50), metabolic pathway (represented by a set of 
graphical representations forming a given metabolic pathway at specific coordinates within 
the map), and a mechanism of action (the identification of which will be more fully described 
by the use of an example below). 

30 This can have significant benefits in the evaluation of treatments and compounds, for 

example, allowing the identification of mechanisms of action, mechanisms of drug 
inactivation and clearance, and potential side effects. 

11 



WO 00/50889 



PCT/US00/04338 



Fig. 6 is an illustration of a select group of related pathways. The related pathways 
shown in Fig. 6 may correspond, for example, to a number of identified biological catalysts 
on the map as depicted in the "big picture" view provided in Fig. 9, which will be described 
further below, 

5 Fig. 6 shows a composite pathway comprising a plurality of pathways (pathway! - 

pathway*)). Each illustrated pathway (pathway] - pathway 9 ) may comprise one or more 
metabolic pathways, as such pathways exist in nature. In this regard, a reference may be 
made, for example, to the Boehringer Mannheim biochemical pathways wall chart. The 
specific pathway shown in Fig. 6 can be viewed to identify mechanisms of action, and 

1 0 toxicology and side effects. 

Many biochemical pathways involve a long chain of distinct chemical reactions 
catalyzed by distinct enzymes. The first committed step in a biosynthetic pathway is often 
regulated by the final product of the pathway through a process called feedback inhibition. 
Inhibition of a specific enzyme along a metabolic pathway leads to increased levels of 

1 5 intermediate chemicals preceding the point of inhibition, and decreased levels of metabolites 
following the point of inhibition. 

In the composite pathway shown in Fig. 6, a point of inhibition A is shown. Enzymes 
in the pathway following the point of inhibition A are repressed, while enzymes in another 
direction following the point of inhibition A are induced. When this occurs, a pathway is 

20 inhibited which prohibits the formation of a given final product, and removes any feedback 
inhibition. Specific enzyme inductions or repressions in response to a disease state, or 
application of a drug to the system, can be used to identify those pathways which are affected 
by the disease or drug. 

For example, as shown in Fig. 6, a drug may be found to decrease serum cholesterol 

25 levels when given to an animal, and that drug may work by an unknown mechanism which is 
revealed by the graphically-represented pathways. Since cholesterol biosynthesis occurs 
primarily in the liver, the liver can be removed and mRNA can be isolated therefrom. Using 
expression profiling techniques, one can determine how this inhibition affects the mRNA 
level of thousands of enzymes acting in dozens of pathways. The pathways whose enzyme 

30 levels are significantly affected by drug treatment indicate the pathway and likely suggest a 
mechanism of drug action. 
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This is the case for inhibitors of hydroxy-methyl-glutaryl-CoenzymeA (HMG-CoA) 
reductase, which is the first step in cholesterol biosynthesis. This step is shown at the top of 
Fig. 6. 

Along pathways HMG-CoA is converted to long-chain fatty acids by way of Acetyl- 
5 CoA in two reaction steps (not specifically shown in detail in Fig. 6). In another direction, 
HMG-CoA is converted to a five carbon isoprenoid via a pathways and then to a ten carbon 
geranyl via a pathway 5 . After another pathway6, a product 15 carbon farnesyl is produced. 
Another pathway? produces a 30 carbon squalene, which is then converted to the steroid 
lanosterol, via pathways. Then, after pathways which comprises a plurality of other reaction 

10 steps, cholesterol is produced. 

When the drug (HMG-CoA reductase inhibitor) is applied to the liver, and expression 
profiling is performed on the treated liver, the HMG-CoA reductase and enzymes involved 
in fatty acid metabolism (which go along the direction of pathwayi-pathway 3 ) are induced, 
and the enzymes involved in the formation of cholesterol are repressed. 

15 The identification of pathways of drug metabolism and elimination is done similarly. 

Most drugs are metabolized by oxidation to a more reactive species than conjugation to a 
sugar or other molecule that is recognized in the kidney for elimination. The oxidative step is 
catalyzed by one or more of over 200 enzymes, including cytochrome P 450 enzymes, 
followed by conjugation by conjugating enzymes in the liver. These enzymes may be 

20 induced directly by the drug, or because the drug competes with a normal substrate, in which 
case less of the normal product is produced by the enzyme pathway, and feedback by that 
product is reduced. 

Induction of some genes is indicative of toxic effects. A variety of enzymes involved 
in drug metabolism are induced in tumor cells (P450 4 Fl) and the induction by a drug can 
25 indicate that a drug is potentially tumorigenic. In addition, metabolism of a drug may create 
toxic metabolites, and may induce peroxidation and proteolytic cascades, which can indicate 
that a drug or drug metabolite is causing cell death or damage. 

Fig. 7 generally shows, in a block diagram, the structure of the database 26 illustrated 
in Fig. 1 . Database 26 comprises, among other elements, seven tables as illustrated in Fig. 1, 
30 including tablel (an experiment), table2 (data), table3 (sequence), table4 (BCI link), table5 
(BCI number), table6 (map link), and table7 (coordinate). 
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The experiment, table 1, is populated by expression profiling subsystem 12 at some 
point in time. It includes experiment identifiers (ExpID) and associated experiment names 
and experiment conditions. Table2 includes the data obtained from the experiment, including 
the experiment identification (ExpID) the sequence identification, sequence ID, and the fold- 
5 change of each sequence that has been identified as being affected. Tablel is linked with 
Table2 by means of the variable ExpID. Table2 holds an associated sequence ID and fold- 
change values in association with each ExpID value. The sequence ID value within table2 is 
associated with a corresponding indexed sequence ID in table3 which serves as a sequence 
table. For each sequence ID, additional variables are associated therewith, including an 

10 accession variable, and a description of the sequence. 

A BCI link table4 is provided which is linked to table2 and table3 in accordance with 
a sequence ID index thereof. BCI link table4 associates with each sequence ID values 
including BCI ID, a sequence/link value, and a link score. Each BCI ID has an associated 
BCI number (BCI) which is listed in tableS. Each BCI ID of table4 and of table5 is linked to 

15 a BCI ID index provided in a map link table6. Each BCI ID has a coordinate ID associated 
therewith, which is provided within map link table6. Map link table6 is linked to coordinate 
table7 by means of a coordinate ID value. Coordinate table7 provides values associated with 
each coordinate ID value, including an x coordinate of the biochemical pathway map, a y 
coordinate of the biochemical pathway map, and a biological function associated with the 

20 given location on the map per the corresponding x and y coordinates. The database 26 may 
be implemented, in the illustrated embodiment, in accordance with the third normal form of 
relational database. It is noted that most of the actual data is stored in tablel, table3, table5 
and table7, while link tables, table 2, table4 and table6 are provided to primarily minimize 
redundancy in the database. 

25 Linking tables, table2, table4, and table6, facilitate the many-to-many relationships. 

Such exist between experiments and genes — many genes are affected in a given experiment, 
and many experiments may be done with each gene. There are also many-to-many 
relationships between genes and BCI numbers (e.g., EC numbers). For example, a multi- 
functional gene may have many EC numbers, and many similar genes could have the same 

30 EC number. Many-to-many relationships also exist between BCI numbers and mapped 
coordinates. For example, if the BCI number comprises an EC number, and the map 
comprises or is modeled after the Boehringer Mannheim biochemical pathways wall chart, 
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one EC number can easily appear more than once within a coordinate or in multiple 
coordinates, and each coordinate can have many EC numbers. 

Fig. 8 is a flowchart illustrating a process of handling data, which is performed by 
analyzing system 10 in connection with its use of database 26. In a first step S40, experiment 
5 data is read and stored in tablel. Then, in step S42, the act of storing sequence data in table3 
is performed. The experiment data stored in tablel includes, among other data, the 
experiment ID (ExpID), the experiment name (ExpName), and the conditions of the 
experiment. The sequence data stored in table3 includes the sequence id, the accession 
number corresponding to that sequence, and description data concerning the sequence. In 

10 step S44, the fold change per sequence (or per sequence cluster) is determined, and that 

information is stored in table2 and related to other data including Expld and the sequence ID. 
In step S46, the BCIs are linked to sequences. Table4 is then used to link the sequences to 
the BCI data in tableS. 

In step S48, the BCIs are linked to map coordinates of the map. Link table table6 is 

1 5 used to link the BCIs to the coordinate data in Table7. 

Fig. 9 shows another overview display of the map. In this view, a point of inhibition 
60 is displayed with a first symbol 60 (which is a square in the illustrated embodiment) at a 
specific location within a particular cell of the map corresponding to the point in the pathway 
at which the inhibition occurs. Second symbols 62a - 621 represent enzymes which 

20 correspond to sequences affected by the treatment. One color (dark gray in Fig. 9) is used to 
represent enzymes which are induced, while another color (white in Fig. 9) represents 
enzymes which were repressed. Third symbols 64a and 64b represent end products of the 
illustrated pathways. The symbols shown in Fig. 9 are all on a common composite pathway. 
End product symbol 64a is shown as dark gray because it is the end product of the pathway 

25 corresponding to the induced enzymes, while end product symbol 64b is shown as white 
because it is the end product corresponding to the pathway which is populated by enzymes 
which were repressed. 

The analysis application 27 may be configured so that various display modes are 
provided, including a first display mode in which the contour map view is provided as shown 

30 in Fig. 4, and a second display mode in which respective overview pathways are provided as 
shown in Fig. 9. When in the second mode, each composite pathway may be separately 
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illustrated on its own, or one map may be provided on which the unrelated composite 
pathways are all indicated. 

A third display mode may be provided in which a detailed view of the map is 
provided. This mode may be entered by the user selectively choosing a detailed map at any 
5 desired set of coordinates, by simply clicking on the desired coordinates in an overview 
display in either of the first and second display modes. 

Fig. 10 is a flow diagram of those steps performed by analysis application 27 to create 
the overview display shown in Fig. 9. In a first step S50, the act of determining specific 
coordinates of BCIs is performed. In a next step S52, the BCIs are determined which are 
10 common to the same pathway. If there is more than one separate unrelated composite 

pathway, a plurality of sets of BCIs are determined and separately categorized. In step S54, 
the induced BCIs of a given common pathway are displayed, with one color representing 
induced BCIs and another color representing repressed BCIs. 

In step S56, subcoordinates of the point of inhibition are determined — if there is a 
15 point of inhibition, i.e., if one side of the common pathway includes all repressed BCIs, while 
another side of the common pathway includes all induced BCIs. This point is displayed at 
the appropriate location within the biochemical pathway map with a second symbol. 

At step S58, the subcoordinates of the end products of the common composite 
pathway are determined, and those points are displayed with a third symbol, with one color 
20 representing the end product of a pathway portion corresponding to induced BCIs and 

another color representing an end product corresponding to the end of a portion of a path 
corresponding to the repressed BCIs. 

The point of inhibition may, for example, be determined by identifying the point 
along a pathway at which the enzymes switch from one affected state (e.g., induction) to 
25 another state (e.g., repression). The end products may, for example, be presumed by 

determining the point along the pathway at which the enzymes are no longer affected, or with 
the use of data known about the relevant pathways. 

Another, more specific embodiment of the present invention will now be described. 
This embodiment is merely an illustrative example. 
30 Initially, a database is created which relates EC (enzyme commission) numbers to 

coordinates on the Boehringer Mannheim biochemical pathways wall chart. This database 
contains current descriptions for all EC numbers and other information pertaining to the EC 

16 



WO 00/50889 



PCT/US00/04338. 



numbers. Descriptions of the EC numbers and other enzyme data are publicly available, and 
may be obtained from the website http:/www.expasy.ch/txt/enzyme.get. A database may 
then be created linking the EC numbers with specific map coordinates corresponding to the 
Boehringer Mannheim biochemical pathways wall chart. 

Once expression profiling is performed, and experiment data is obtained, EC numbers 
are assigned to the sequence clusters obtained in the experiment. This may involve a list of 
GenBank accession numbers corresponding to those affected genes affected more than two 
fold in a set of profiling experiments. GenBank records are available at 
http://www.ncbi.nom.nih.gov/entrez/, and may be parsed for the pattern of numbers in an EC 
number (#.#.#.#). For every occurrence of an EC number in the GenBank file, a GenBank 
accession number and corresponding EC number may be written to a text file for loading into 
a database. The following is a sample GenBank file: 
LOCUS 4191746 375 aa 27-JAN-1999 

DEFINITION alcohol dehydrogenase; ADH.ACCESSION 41 91 746PID g41 91 746 
DBSOURCE GENBANK: locus L301 13, accession L301 1 3KEY WORDS . 
SOURCE baboon. ORGANISM Papio hamadryas 

Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; 
Primates; Catarrhini; Cercopithecidae; Cercopithecinae; Papio. 
REFERENCE 1 (residues 1 to 375) 
AUTHORS Cheung,B-, Holmes,R.S., Easteal,S. and BeachamJ.R. 
TITLE Evolution of Class I Alcohol Dehydrogenase Genes in Catarrhine 
Primates: Gene Conversion, Substitution Rates, and Gene Regulation 
JOURNAL Mol. Biol. Evol. 16 (1), 23-36 (1999) 
FEATURES Location/Qualifiers source 1..375 

/organism—Tapio hamadryas" 
/db_xref^"taxon:9557" 

/tissue^ype-'kidney" Protein 1 ..375 
/note="ADH" 

/product-'alcohol dehydrogenase" 
/EC_number=" 1.1.1.1" CDS 1 . . 3 75 

/note="putative" 

/coded_by="L301 13:53-11 80"ORIGIN 
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1 mstagkvikc kaavlwevkk pfsieeveva ppkahevrik mvavgicrsd dhvvsgtlvt 
61 plpailghea agivegvgeg vttvkpgdkv iplftpqcgk crvcknpesn ycfkndlsnp 
121 rgtmqdgtrr ftcggkpihh flgistfsqy twdenavak idaasplekv cligcgfstg 
181 ygpavkvakv tpgstcavfg lggvglsavm gckaagaari iavdinkdkf akakelgate 
5 241 cinpqdykkp iqevlkemtd ggvdfsfevi grldtimasl lccheacgts vivgvppdsq 

301 nlsinpvlll tgrtwkgaif ggfkskesvp klvsdfmakk fsldalitnv lpfekinegf 
361 dllrsgksir tilmf// 

If no EC number is available in the GenBank file, the nucleotide or amino acid 

10 sequence may be obtained from the GenBank file which corresponds to a particular cluster 

obtained from the expression profiling, and a BLAST sequence alignment may be performed, 
which may be performed by accessing the publicly available application through 
http://www.ncbi.nlm.nih.gov/cgi-vin/BLAST/nph-newblast?Jform=0. The GenBank file may 
then be fetched for each sequence that aligns with an expect value (E value, right-most 

15 column in the BLAST results) that is less than 1 e-30, and by looking for EC numbers in 

these related sequence files. If an EC number is present, the accession number for the gene 
affected in the expression profiling experiment can be recorded, and the expect value from 
the sequence alignment may be recorded as well, along with the EC number or numbers 
found in the related sequence file or files, 

20 At this point, the database can be created, as described previously in this document. 

In this regard, in accordance with the specific embodiment now being described, database 26 
as shown in Fig, 1 may comprise an ORACLE database, and host computer may comprise a 
Silicon Graphics Origin 2000 computer. These items are merely illustrative, and are not 
meant to limit the invention in any way. Other computer systems, databases, and database 

25 structures may be used. 

Analysis application 27 may be implemented with use of a Netscape FastTrack 
WWW server using standard HTML and Perl. The Perl modules which may be used to 
implement this application include (1) DBI/DBD — a database interface for communicating 
with a remote pathmap database, (2) CGI — for generating HTML code, (3) PGPLOT — an 

30 interface to compiled PGPLOT Fortran libraries for creating contour plots, (4) GD — a 

graphical drawing module for cropping a GIF image produced by PGPLOT and for drawing 
polygons and rectangles used for background coloring, (5) MLDBM — a Perl module that 
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allows creation of a persistent multi-level data structure to implement image map shape data, 
and (6) ImageMagick — a module for performing image processing, so that the background 
created with GD can be used to create masks, overlays and background coloring. 

The application may be configured so that a user can connect to a path map web page 
through the use of browser application 16, select an experiment, and query the database to 
select the wall chart coordinates of genes affected more than two-fold in the experiment. The 
number of genes mapped to each map coordinate are binned, and a contour plot of hits per 
coordinate may be displayed, for example, as shown in Fig. 4. Other displays may be 
provided, as well, such as those shown in Fig. 9. The user may move the cursor with the use 
of the mouse to the position on the map image to see the biological function corresponding to 
that area of the map, and can click on that particular cell of the map to obtain a more detailed 
view of the pathway information, such as that shown in Fig. 5. In this regard, if the 
Boehringer Mannheim biochemical pathways wall chart structure is used, it is modified to 
illustrate the induced and repressed genes, as well as the EC numbers in association with the 
identified enzymes corresponding to those genes. The enzymes corresponding to affected 
genes are colored based upon whether the gene was repressed or induced. Specifically, the 
enzyme may be represented with magenta text if the corresponding gene cluster was induced, 
cyan if it was repressed, and green if two or more gene clusters with the same EC number 
were affected in opposite directions. The interface provided to the user through browser 
application 16 is displayed on display 20, and may provide a mechanism for allowing the user 
to click on the accession number in order to obtain information on a particular gene and all 
available experiments pertinent to the gene. A mechanism may also be provided to allow 
clicking on a particular EC number to obtain all information relating to that EC number. In 
addition, the analysis system 10 may be provided with a search tool to allow the user to 
submit queries by any given parameter to obtain information related to that parameter. For 
example, the user may query by accession number or gene description to find information for 
a specific gene of interest. 
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WHAT IS CLAIMED IS: 

1 . A data processing system for managing and presenting information derived from a 
differential expression of genetic information which can be used to model a physiological 
response of biological cells, said system comprising: 

a data-represented map of metabolic pathways, said map having a matrix of 
regions and locations within each said region, said regions being defined by respective 
coordinate sets, each of said metabolic pathways comprising at certain ones of said locations 
respective graphical representations of a reaction, a substrate and a product of the reaction, 
and at least one biological catalyst of the reaction; and 

an association mechanism which links previously generated sets of affected 
sequence data to biological catalyst identifiers each identifying one or more unique biological 
catalysts, and which links said biological catalyst identifiers to said coordinate sets, said sets 
of affected sequence data representing a direction and a magnitude of regulation of each one 
of a number of different nucleic acid sequences. 

2. The data processing system of claim 1 further comprising: 

a grading mechanism which assigns a corresponding grade to a corresponding 
region in accordance with a number of said sets of affected sequence data linked to said 
corresponding region by said association mechanism. 

3. The data processing system of claim 2 further comprising: 

a display mechanism for displaying an overview of said map and a detailed 
view of said map, said overview graphically representing said regions of said map and said 
corresponding grades, and said detailed view comprising detailed graphical representations of 
said metabolic pathways comprising affected biological catalyst indications of locations of 
those of said biographical catalysts corresponding to said sets of affected sequence data, each 
said affected biological catalyst indication representing a direction of regulation of the set of 
affected sequence data corresponding thereto. 
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4. A method of managing and presenting information derived from a differential 
expression of genetic information which can be used to model a physiological response of 
biological cells, said method comprising: 

providing a data-represented map of metabolic pathways, said map having a 
matrix of regions and locations within each said region, said regions being defined by 
respective coordinate sets, each of said metabolic pathways comprising at certain ones of said 
locations respective graphical representations of a reaction, a substrate and a product of the 
reaction, and at least one biological catalyst of the reaction; and 

linking sets of previously generated affected sequence data to biological 
catalyst identifiers each identifying one or more unique biological catalysts, and which links 
said biological catalyst identifiers to said coordinate sets, said sets of affected sequence data 
representing a direction and a magnitude of regulation of each one of a number of different 
nucleic acid sequences. 

5. The method of claim 4 further comprising: 

assigning a corresponding grade to a corresponding region in accordance with 
a number of said sets of affected sequence data linked to said corresponding region by said 
linking. 

6. The method of claim 5 further comprising: 

displaying an overview of said map and a detailed view of said map, said 
overview graphically representing said regions of said map and said corresponding grades, 
and said detailed view comprising detailed graphical representations of said metabolic 
pathways comprising affected biological catalyst indications of locations of those of said 
biographical catalysts corresponding to said sets of affected sequence data, each said affected 
biological catalyst indication representing a direction of regulation of the set of affected 
sequence data corresponding thereto. 

7. A machine-readable medium having data recorded thereon such that when the data 
is read into a computer and executed, the data causes the computer to: 

use a data-represented map of metabolic pathways, said map having a matrix 
of regions and locations within each said region, said regions being defined by respective 
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coordinate sets, each of said metabolic pathways comprising at certain ones of said locations 
respective graphical representations of a reaction, a substrate and a product of the reaction, 
and at least one biological catalyst of the reaction; and 

link sets of previously generated affected sequence data to biological catalyst 
identifiers each identifying one or more unique biological catalysts, and which links said 
biological catalyst identifiers to said coordinate sets, said sets of affected sequence data 
representing a direction and a magnitude of regulation of each one of a number of different 
nucleic acid sequences. 

8. The machine-readable medium of claim 7, wherein said data further causes the 
computer to: 

assign a corresponding grade to a corresponding region in accordance with a 
number of said sets of affected sequence data linked to said corresponding region by said 
linking. 

9. The machine-readable medium of claim 8, wherein said data further causes the 
computer to: 

display an overview of said map and a detailed view of said map, said overview 
graphically representing said regions of said map and said corresponding grades, and said 
detailed view comprising detailed graphical representations of said metabolic pathways 
comprising affected biological catalyst indications of locations of those of said biographical 
catalysts corresponding to said sets of affected sequence data, each said affected biological 
catalyst indication representing a direction of regulation of the set of affected sequence data 
corresponding thereto. 
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The person identified below is hereby/has been appointed to act on behalf Iwl I 1 . ^ 
of the applicant(s) before the competent International Authorities as: 1211 a 8 ent 1 1 common representative 


Name and address: (Family name followed by given name; for a legal entity, full official 
designation. The address must include postal code and name of country.) 

KOKULIS, Paul N. 

PILLSBURY MADISON & SUTRO, LLP 
1 100 New York Avenue, NW 
Washington, DC 20005 
United States of America 


Telephone No. 
202 861-3000 


Facsimile No. 
202 822-0944 


Teleprinter No. 


r - ] Address for correspondence: Mark this check-box where no agent or common representative is/has been appointed and the 
L_l space above is used instead to indicate a special address to which correspondence should be sent. 
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Box No.V DESIGNATION OF STATES ■ 

The following designations are hereby made under Rule 4.9(a) (mark the applicable check-boxes; at least one must be marked): 
Regional Patent 

63 AP ARIPO Patent: GH Ghana, GM Gambia, KE Kenya, LS Lesotho, MW Malawi, SD Sudan, SL Sierra Leone, SZ Swaziland, 
TZ United Republic of Tanzania, UG Uganda, ZW Zimbabwe, and any other State which is a Contracting State of the Harare 
Protocol and of the PCT 

Kl EA Eurasian Patent: AM Armenia, AZ Azerbaijan, BY Belarus, KG Kyrgyzstan, KZ Kazakhstan, MD Republic of Moldova, 
RU Russian Federation, TJ Tajikistan, TM Turkmenistan, and any other State which is a Contracting State of the Eurasian 



E3 EP European Patent: AT Austria, BE Belgium, CH and LI Switzerland and Liechtenstein, CY Cyprus, DE Germany, 
DK Denmark, ES Spain, FI Finland, FR France, GB United Kingdom, GR Greece, IE Ireland, IT Italy, LU Luxembourg, 
MC Monaco, NL Netherlands, PT Portugal, SE Sweden, and any other State which is a Contracting State of the European 
Patent Convention and of the PCT 

18 OA OAPI Patent: BF Burkina Faso, BJ Benin, CF Central African Republic, CG Congo, CI Cote d'lvoire, CM Cameroon, 
GA Gabon, GN Guinea, GW Guinea-Bissau, ML Mali, MR Mauritania, NE Niger, SN Senegal, TD Chad, TG Togo, and any 
other State which is a member State of OAPI and a Contracting State of the PCT (if other kind of protection or treatment 
desired, specify on dotted line) 

National Patent (if other kind of protection or treatment desired, specify on dotted line): 



E3 


AE 


United Arab Emirates 


- IS 


LR 


Liberia 


B 


AL 


Albania 


b 


LS 


Lesotho 


53 


AM 


Armenia 


si 


LT 


Lithuania 


IS 


AT 


Austria 


El 


LU 


Luxembourg 




AU 


Australia 


El 


LV 


Latvia 




AZ 


Azerbaijan 


El 


MA 


Morocco 




BA 


Bosnia and Herzegovina 


El 


MD Republic of Moldova 


EJ 


BB 


Barbados 


El 


MG Madagascar 


B 


BG 


Bulgaria . . . 


El 


MK The former Yugoslav Republic of Macedonia 


B 


BR 


Brazil 








IS 


BY 


Belarus 


El 


MN Mongolia 


B 


CA 


Canada 


El 


MW Malawi 


IS 


CH and LI Switzerland and Liechtenstein 


El 


MX 


Mexico 


B 


CN 


China 


El 


NO 


Norway 


B 


CR 


Costa Rica 


El 


NZ 


New Zealand 


B 


CU 


Cuba 


El 


PL 


Poland 


IS 


CZ 


Czech Republic 


El 


PT 


Portugal 


B 


DE 


Germany 


El 


RO 


Romania 


B 


DK 


Denmark 


El 


RU 


Russian Federation 


B 


DM 


Dominica 


El 


SD 


Sudan 


B 


EE 


Estonia . 


El 


SE 


Sweden 


B 


ES 


Spain 


El 


SG 


Singapore 


B 


FI 


Finland 


El 


SI 


Slovenia 


B 


GB 


United Kingdom 


El 


SK 


Slovakia 


B 


GD 


Grenada 


El 


SL 


Sierra Leone . . 


ts 


GE 


Georgia 


El 


TJ 


Tajikistan 


B 


GH 


Ghana 


El 


TM 


Turkmenistan 


B 


GM 


Gambia 


El 


TR 


Turkey 


B 


HR 


Croatia 


El 


TT 


Trinidad and Tobago 


B 


HU 


Hungary 


El 


TZ 


United Republic of Tanzania 


B 


ID 


Indonesia 


El 


UA 


Ukraine 


B 


IL 


Israel 


El 


UG 


Uganda 


B 


IN 


India 


El 


US 


United States of America continuation in part 


B 


IS 


Iceland 








B 


JP 


Japan . . 


El 


UZ 


Uzbekistan 


B 


KE 


Kenya 


El 


VN 


Viet Nam 


B 


KG 


Kyrgyzstan 


El 


YU 


Yugoslavia 


IS 


KP 


Democratic People's Republic of Korea 


El 


ZA 


South Africa 




El 


ZW 


Zimbabwe . 


B 


KR 


Republic of Korea 


Check-boxes reserved for designating States which have become 


B 


KZ 


Kazakhstan 


party to the PCT after issuance of this sheet: 


81 


LC 


Saint Lucia 


□ 






B 


LK 


Sri Lanka 


□ 







Precautionary Designation Statement: In addition to the designations made above, the applicant also makes under Rule 4.9(b) all 
other designations which would be permitted under the PCT except any designation(s) indicated in the Supplemental Box as being 
excluded from the scope of this statement. The applicant declares that those additional designations are subject to confirmation and 
that any designation which is not confirmed before the expiration of 1 5 months from the priority date is to be regarded as withdrawn by 
the applicant at the expiration of that time limit. (Confirmation (including fees) must reach the receiving Office within the 15-month time limit) 
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Sheet No, 



Supplemental Box If the Supplemental Box is not used, this sheet need not be included in the request. 



1. If, in any of the Boxes, the space is insufficient to furnish all the information: in such case, write "Continuation of Box No. " 
[indicate the number of the Box] and furnish the information in the same manner as required according to the captions of the Box in 
which the space was insufficient, in particular: 

(i) if more than two persons are involved as applicants and/or inventors and no "continuation sheet" is available: in such case, 
write "Continuation of Box No. HI" and indicate for each additional person the same type of information as required in Box 
No. III. The country of the address indicated in this Box is the applicant's State (that is, country) of residence if no State of 
residence is indicated below; 

(ii) if in Box No. 11 or in any of the sub-boxes of Box No. Ill the indication "the States indicated in the Supplemental Box" is 
checked: in such case, write "Continuation of Box No. 11" or "Continuation of Box No. Ill" or "Continuation of Boxes No. 11 
and No. Ill" (as the case may be), indicate the name of the applicants) involved and, next to (each) such name, the State(s) 
(and/or, where applicable, AR1PO, Eurasian, European or OAPl patent) for the purposes of which the named person is 
applicant; 

(Hi) if, in Box No. 11 or in any of the sub-boxes of Box No. Ill, the inventor or the inventor/applicant is not inventor for the 
purposes of all designated States or for the purposes of the United States of America: in such case, write "Continuation of 
Box No. 11" or "Continuation of Box No. Ill" or "Continuation of Boxes No. 11 and No. Ill" (as the case may be), indicate the 
name of the inventor (s) and, next to (each) such name, the State(s) (and/or, where applicable, AR1PO, Eurasian, European or 
OAPl patent) for the purposes of which the named person is inventor; 

(iv) if, in addition to the agent(s) indicated in Box IV, there are further agents: in such case, write "Continuation of Box No. IV" 
and indicate for each further agent the same type of information as required in Box No. IV; 

(v) if, in Box No. V, the name of any State (or OAPl) is accompanied by the indication "patent of addition," or "certificate of 
addition," or if, in Box No. V, the name of the United States of America is accompanied by an indication "continuation" or 
"continuation-in-part": in such case, write "Continuation of Box No. V" and the name of each State involved (or OAPl), and 
after the name of each such State (or OAPl) , the number of the parent title or parent application and the date of grant of the 
parent title or filing of the parent application; 

(vi) if in Box No. VI, there are more than three earlier applications whose priority is claimed: in such case, write "Continuation 
of Box No. VI" and indicate for each additional earlier application the same type of information as required in Box No. VI; 

if in Box No. VI, the earlier application is an ARIPO application: in such case, write "Continuation of Box No. VI", specify 

(vii) the number of the item corresponding to that earlier application and indicate at least one country party to the Paris 
Convention for the Protection of Industrial Property or one Member of the World Trade Organization for which that earlier 
application was filed. 

2. If, with regard to the precautionary designation statement contained in Box No. V, the applicant wishes to exclude any State(s) 
from the scope of that statement: in such case, write "Designation^) excluded from precautionary designation statement" and 
indicate the name or two-letter code of each State so excluded. 

3. If the applicant claims, in respect of any designated Office, the benefits of provisions of the national law concerning 
non-prejudicial disclosures or exceptions to lack of novelty: in such case, write "Statement concerning non-prejudical disclosures 
or exceptions to lack of novelty" and furnish that statement below. 



LIPPITT, Raymond F. 
KNIGHT, G. Lloyd 
LOVE, Carl G. 
PERRY, Glenn J. 
MORDUCH, Ruth N. 
WHITE, Paul E., Jr. 
BARUFKA, Jack S. 
SMYRSKI, Steven W. 
GLAZIER, Stephen C. 
NAGY, Paul G. 



BIRD, Donald J. 
ECCLESTON, Lynn E. 
GOWDEY, Peter W. 
LAZAR, Dale S. 
McQUADE, Paul F. 
JOYCE, Kevin E. 
SIRILLA, George M. 
ZAITLEN, Richard H. 
HESS, Adam R. 
ATKINS, William P. 



EDGELL, G. Paul 
JAKOPIN, David A. 
PAULSON, Mark G. 

KLIMA, Timothy J. 
COLTON, Kendrew H. 

DZWONCZYK, Michael R. 
WISE, Roger R. 
BENGTSSON, W. Patrick 
FINKELSTEIN, Jay M. 
SHARER. Paul L 



All attorneys are partners of the firm of PILLSBURY MADISON & SUTRO, LLP. The address, telephone number and 
facsimile number of all of the above attorneys are as indicated in Box IV. 



Continuation of Box No. V: 

US: 60/121,432; 23 February 1999 (23.02.99) 
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Box No. VI PRIORI 



Ti^LAIM 



■Sheet No A . . 



D Further priority claims are indicated in the Supplemental Box. 



Filing date 
of earlier application 
(day/month/year) - 


Number 
of earlier application 

- 


Where earlier application is: 


national application: 
country 


regional application:* 
regional Office 


international application: 
receiving Office 


item(l) 

23 February 1999 
(23.02.99) 


60/121,432 


US 






item (2) 










item (3) 











V£ The receiving Office is requested to prepare and transmit to the International Bureau a certified copy 
of the earlier application(s) (only if the earlier application was filed with the Office which for the 
purposes of the present international application is the receiving Office) identified above as item(s): (1) 



* Where the earlier application is an ARIPO application, it is mandatory to indicate in the Supplemental Box at least one country party to the Paris Convention/or the 
Protection of Industrial Property for which that earlier application was filed (Rule -I, I0(bJ(ii)J. See Supplemental Box. 



Box No. Vn INTERNATIONAL SEARCHING AUTHORITY 



Choice of International Searching Authority (ISA) 

(if two or more International Searching Authorities are 
competent to carry out the international search, indicate the 
Authority chosen; the two-letter code may be used): 

ISA/ us 



Request to use results of earlier search; reference to that search (if an earlier 
search has been carried out by or requested from the International Searching Authority): 

Date (day/month/year) Number Country (or regional Office) 



Box No. VIII CHECK LIST: LANGUAGE OF FILING 



This international application contains 
the following number of sheets: 



request 

description (excluding 
sequence listing part) 

claims 

abstract 

drawings 

sequence listing part 
of description 

Total number of sheets 



37 



This international application is accompanied by the item(s) marked below: 





i.BI 


4 


2.D 




3-D 


19 


4.D 


3 


5.D 


1 


6.D 


10 


7.D 




8.D 




9. SI 



Figure of the drawings which 
should accompany the abstract: 


Language of filing of the 

international application: English 


Box No. DC SIGNATURE OF APPLICANT OR AGENT 



Next to each signature, indicate the name of the person signing and the capacity in which the person signs (if such capacity is not 
obvious from reading the request). 



t^pt L\ ( 



Paul N. Kokulis 




1. 


Date of actual receipt of the purported 
international application: 


2. Drawings: 


3. 


Corrected date of actual receipt due to later but 
timely received papers or drawings completing the 
purported international application: 




| [ received: 


4. 


Date of timely receipt of the required 
corrections under PCT Article 1 1(2): 




| | not received: 


5. 


International Searching Authority a / 
(if two or more are competent): lo A/ 


6. 1 1 Transmittal of search copy delayed 
1 1 until search fee is paid. 





Date of receipt of the record copy 
by the International Bureau: 



For International Bureau use only 
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This sheer is not part of and does not count as a sheet of the international .pplication. 



PCT 



FEE CALCULATION SHEET 
Annex to the Request 



Applicant's or agent's 
file reference 



5956 PCT 



For receiving Office use only 



International application No. 



Date stamp of the receiving Office 



Applicant 

WARNER-LAMBERT COMPANY 



CALCULATION OF PRESCRIBED FEES 

1. TRANSMITTAL FEE 



240.00 



2. SEARCH FEE 

International search to be carried out by 



US 



700.00 



(If two or more International Searching Authorities are competent in relation to the international 
application, indicate the name of the Authority which is chosen to carry out the international search.) 



3. INTERNATIONAL FEE 
Basic Fee 

The international application contains 
first 30 sheets ........ 

7 x $10.00 



37 



sheets. 



427.00 



bl 



remaining sheets additional amount 



70.00 



b2 



Add amounts entered at bl and b2 and enter total at B 



L 



497.00 



Designation Fees 

The international application contains AH designations. 



8 



92.00 



number of designation fees 
payable (maximum 8) 



amount of designation fee 



736.00 



Add amounts entered at B and D and enter total at I 

(Applicants from certain States are entitled to a reduction of 75% of the 
international fee. Where the applicant is (or all applicants are) so entitled, the 



1,233.00 I 



4. FEE FOR PRIORITY DOCUMENT (if applicable) 

5. TOTAL FEES PAYABLE . 

Add amounts entered at T, S, I and P, and enter total in the TOTAL box 



15.00 



2,188.00 



TOTAL 



□ 



The designation fees are not paid at this time. 



MODE OF PAYMENT 

authorization to charge 
deposit account (see below) 



lX| cheque 
□ postal money order 



| | bank draft 
| | cash 

□ 

revenue stamps 



| | coupons 

| | other (specify): 



DEPOSIT ACCOUNT AUTHORIZATION (this mode of payment may not be available at all receiving Offices) 
The RO/ us | | is hereby authorized to charge the total fees indicated above to my deposit account. 

(this check-box may be marked only if the conditions for deposit accounts of the receiving Office so permit) is 
hereby authorized to charge any deficiency or credit any overpayment in the total fees indicated above to my 
deposit account. 

□ is hereby authorized to charge the fee for preparation and transmittal of the priority document to the International 
Bureau of WIPO to my deposit account. *\ ^ f "i _ C 



03-3975 



22 February 2000 




Deposit Account No. 



Date (day/month/year) 



Signature 



Paul N. Kokulis 
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PCT/US00/04338 

fATENT COOPERATION TREATY 



From the INTERNATIONAL BUREAU 



PCT 


To: 


NOTIFICATION CONCERNING 
. SUBMISSION OR TRANSMITTAL 
OF PRIORITY DOCUMENT 

(PCT Administrative Instructions, Section 411) 


KOKULIS Paul, N. 
Pillsbury Madison & Sutro LLP 
1 1 00 New York Avenue, N.W. 
Washington, DC 20005 
ETATS-UNIS D'AMERiQUE 


Date ot mailing taay/monin/yearj 

27 April 2000 (27.04.00) 




Applicant's or agent's. file reference 
5956 PCT 


IMPORTANT NOTIFICATION 


International application No. 
PCT/US00/04338 


International filing date (day/month/year) 

22 February 2000 (22.02.00) 


International publication date (day/month/year) 

Not yet published 


Priority date (day/month/year) 

23 February 1999 (23.02.99) 


Applicant 




WARNER-LAMBERT COMPANY et al 




1 The applicant is hereby notified of the date of receipt {except where the letters "NR" appear in the right-hand column) by the 
International Bureau of the priority document(s) relating to the earlier application(s) indicated below. Unless otherwise 
indicated by an asterisk appearing next to a date of receipt or by the letters "NR", in the right-hand column the priority 
document concerned was submitted or transmitted to the International Bureau in compliance with Rule 17.1(a) or (b). 


2. This updates and replaces any previously issued notification concerning submission or transmittal of priority documents. 


3 An asterisk*) appearing next to a date of receipt, in the right-hand column, denotes a priority document submitted 
or transmitted to the International Bureau but not in compliance with Rule 17.1(a) or (b). In such a case, the attention 
of the applicant is directed to Rule 17.1 (c) which provides that no designated Office may disregard the priority claim 
concerned before giving the applicant an opportunity, upon entry into the national phase, to furnish the pr.or.ty document 
within a time limit which is reasonable under the circumstances. 


4 The letters "NR" appearing in the right-hand column denote a priority document which was not received by the International 
Bureau or which the applicant did not request the receiving Office to prepare and transmit to the International Bureau, 
as provided by Rule 17.1(a) or (b). respectively. In such a case, the attention of the applicant is directed to Rule 17.1(c) which 
provides that no designated Office may disregard the priority claim concerned before giving the applicant an opportunity, 
upon entry into the national phase, to furnish the priority document within a time limit which is reasonable under the 
circumstances. 


Priority date Prioritv aDDlication No. 


Countrv or reqional Office Date of receipt 
or PCT receivina Office of Drioritv document 


23 Febr 1999 (23.02.99) 60/121,432 


US 17 Apri 2000 (17.04.00) 


The International Bureau of WIPO 
34, chemin des Colombettes 
121 1 Geneva 20, Switzerland 

Facsimile No. (41-22) 740.14.35 


Authorized officer y 

Marc Salzman s^J\^ 

Telephone No. (41 -22) 338.83.38 


Form PCT/IB/304 (July 1998) 


003249987 



PATENT COOPERATION TREATY 

PCT 

INTERNATIONAL SEARCH REPORT 
(PCT Article 18 and Rules 43 and 44) 



Applicant's or agents file reference 
5956 PCT 


FOR FURTHER see Notification of Transmittal of International Search Report 
ACTION (Form PCT/ISA/220) as well as, where applicable, item 5 below. 


International application No. 
PCT7US00/04338 


International filing date (day/month/year) 
25 FEBRUARY 2000 


(Earliest) Priority Date (day/mofUh/year) 
23 FEBRUARY 1999 


Applicant 

WARNER-LAMBERT COMPANY 



This international search report has been prepared by this International Searching Authority and is transmitted to the applicant 
according to Article 18. A copy is being transmitted to the International Bureau. 

This international search report consists of a total Naf ^) sheets. 

| Xj It is also accompanied by a copy of each prior art document cited in this report. 



1. Basis of the report 

a. With regard to the language, the international search was carried out on the basis of the international application in the 

language in which it was filed, unless otherwise indicated under this item. 

□ the international search was carried out on the basis of a translation of the international application furnished to this 
Authority (Rule 23.1(b)). 

b. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the international search 

was carried out on the basis of the sequence listing: 

I 1 contained in die international application in written form. 

I | filed together with the international application in computer readable form. 

| | furnished subsequently to this Authority in written form. 

| | furnished subsequently to this Authority in computer readable form. 

|~ 1 the statement that the subsequently furnished written sequence listing does not go beyond the disclosure in the 
international application as filed has been furnished. 

| | the statement that the information recorded in computer readable form is identical to the written sequence listing has been 
furnished. 

2 - I I Certain claims were found unsearchable (See Box I). 

3. | | Unity or invention is lacking (See Box II). 

4. With regard to the title, 

| x| the text is approved as submitted by the applicant 

| j the text has been established by this Authority to read as follows: 



5. With regard to the abstract, 

| X| the text is' approved as submitted by the applicant 

| | the text has been established, according to Rule 38.2(b), by this Authority as it appears in 
Box III. The applicant may, within one month from the date of mailing of this international 
search report, submit comments to this Authority. 

6. The figure of the drawings to be published with the abstract is Figure No. 

| | as suggested by the applicant 
[ | - because the applicant failed to suggest a figure. 
| ] because this figure better characterizes the invention. 



X| None of the figures. 



Form PCT/ISA/210 (first sheet) (July 1998)* 




PATENT COOPERATION TkriATY 



From the INTERNATIONAL SEARCHING AUTHORITY 



To: PAUL N. KOKULIS 

PILLSBURY MADISON & SUTRO, LLP 
1 100 NEW YORK AVENUE, NW 
WASHINGTON, D.C. 20005 


PCT 

NOTIFICATION OF TRANSMITTAL OF 
THE INTERNATIONAL SEARCH REPORT 
OR THE DECLARATION 

(PCT Rule 44.1) 


S&£2$&> 29JUN2000 


Applicant's or agent's file reference 
5956 PCT 


FOR FURTHER ACTION See paragraphs 1 and 4 below 


International application No. 
PCT/US00/04338 


International filing date 
(day/month/year) 

25 FEBRUARY J2000 


Applicant 
WARNER-LAMBERT COMPANY 



The applicant is hereby notified that the international search report has been established and is transmitted herewith. 
FUing of amendments and statement under Article 19: 

The applicant is entitled, if he so wishes, to amend the claims of the international application (see Rule 46): 

When? The time limit for filing such amendments is normally 2 months from the date of transmittal of the 
international search report; however, for more details, see the notes on the accompanying sheet 

Where? Directly to the International Bureau of WIPO 
34, chemin des Colombettes 
1211 Geneva 20, Switzerland 
Facsimile No.: (41-22)740.14.35 

For more detailed instructions, see the notes on the accompanying sheet 



The applicant is hereby notified that no international search report will be established and that the declaration under 
Article 17(2X&) to that effect is transmitted herewith. 



2 D 



3. | | With regard to the protest against payment of (an) additional fee(s) under Rule 40.2, the applicant is notified that 

□ the protest together with the decision thereon has been transmitted to the International Bureau together with the 
applicant's request to forward the texts of both the protest and the decision thereon to the designated Offices. 




no decision has been made yet on the protest; the applicant will be notified as soon as a decision is made. 



4. Further action(s): The applicant is reminded of the following: 

Shortly after 18 months from the priority date, the international application will be published by the International Bureau. If 
the applicant wishes to avoid or postpone publication, a notice of withdrawal of the international application, or of the 
priority claim, must reach the International Bureau as provided in rules 90 bis 1 and 90 bis 3, respectively, before the 
completion of the technical preparations for international publication. 

Within 19 months from the priority date, a demand for international preliminary examination must be filed if the applicant 
wishes to postpone the entry into the national phase until 30 months from the priority date (in some Offices even later). 

Within 20 months from the priority date, the applicant must perform the prescribed acts for entry into the national phase before 
all designated Offices which have not been elected in the demand or in a later election within 19 months from the priority 
date or could not be elected because they are not bound by Chapter II. 





Name and mailing address of the ISA/US 

Commissioner of Patents and Trademarks 
Box PCT 

Washington, D.C. 20231 
Facsimile No. (703) 305-3230 


Authorized officer 
MA*Y K ZEMAN 

Telephone No. (703) 308-0196 


Form PCT/ISA/220 (July 1998)* 


(See notes on accompanying sheet) 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT7US00/04338 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC(7) :G01N 31/00; G06F 19/00; 

US CL :702/20 t 19, 27; 435/6 
According to International Patent Classification (IPC) or to both national classification and IPC 

R FIELDS SEARCHED 

Minimum documentation searched (classification system followed by classification symbols) 

U.S. : 702/20, 19. 27; 435/6 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practicable, search terms used) 

EAST, WEST, Derwent, MEDLINE, CAPlus, Biotechno (STN) 
expression profiling, map, sequence, computer/processor 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 


X 


BASSETT JR*, D.E. et al. Gene Expression Information- It's all in 
Your Mine. Nature Genetics. 01 January 1999, Volume 21 
(Supplement) pages 51-55, see entire document. 


1-9 


Y 


DEBOUCK, C. et al. DNA Microarrays in Drug Discovery and 
Development. Nature Genetics. 01 January 1999, Vol. 21 
(Supplement) pages 48-50, see entire document 


1-9 


Y 


IYER, V.R. et al. The Transcriptional Program in the Response of 
Human Fibroblasts to Serum. Science. 01 January 1999, Vol 283, 
pages 83-87, see entire document. 


1-9 



1 x[ Further documents are listed in the continuation of Box C. [ | See patent family annex 



Special categories of cited documents: 

document defining the general stale of the art which is not considered 
to be of particular relevance 

earlier document published on or after the international filing date 

document which may throw doubts on priority claim (i) or which is 
cited to establish the publication date of another citation or other 
special reason (as specified) 

document referring to an oral disclosure, use, exhibition or other 



document published prior to the international filing date but later than 
the priority date claimed 



later document published after the international filing date or priority 
date and not in conflict with the application but cited to understand 
the principle or theory underlying the invention 

document of particular relevance; the claimed invention ?=nnot be 
considered novel or cannot be considered to involve an inventive sup 
when the document is taken alone 

document of particular relevance; the claimed invention cannot be 
considered to involve an inventive sup when the document is 
combined with one or more other such documents, such combination 
being obvious to a person skilled in the art 

document member of the same patent family 
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Applicant 
WARNER-LAMBERT COMPANY 



1 . This international rrelirninary examination report has been prepared by this International Preliminary 
Examining Authority and is transmitted to the applicant according to Article 36. 

2. This REPORT consists of a total of sheets, including this cover sheet. 

| | This report is also accompanied by ANNEXES, i.e., sheets of the description, claims and/or drawings 
which have been amended and are the basis for mis report and/or sheets containing rectifications made 
before this Authority (see Rule 70. 16 and Section 607 of the Administrative Instructions under the PCT). 

These annexes consist of a total of sheets. 



3. This report contains indications relating to the following items: 

Basis of the report 
Priority 

Non-establishment of report with regard to novelty, inventive step and industrial apphcability 



Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial 
apphcability; citations and explanations supporting such statement 
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Name and mailing address of the IPEA/US 
Commissioner of Patents and Trademarks 
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Washington, D.C 20231 
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Authorized officer 
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I. Basis f the report 



1. With regard to the elements of the international application:* 
^ the international application as originally filed. 



^ the description: 

pages 1-19 as originally filed 

pages NONE , filed with the demand 

pages NONE , filed with the letter of 

the claims: 

pages 20-22 t as originally filed 

pages NONE , as amended (together with any statement) under Article 19 

pages NONE , filed with the demand 

pages NONE , filed with the letter of 



^ the drawings: 

pages 1-10 t as originally filed 

pages NONE , filed with the demand 

pages NONE , filed with the letter of 

| | the sequence listing part of the description: 

pages NONE , as originally filed 

pages NONE , filed with the demand 

pages NONE , filed with the letter of 



2. With regard to the language, all the elements marked above were available or furnished to this Authority in the 
language in which the international application was filed, unless otherwise indicated under this item. 
These elements were available or furnished to this Authority in the following language which is: 

| | the language of a translation furnished for the purposes of international search (under Rule23.1(b)). 

| | the language of publication of the international application (under Rule 48.3(b)). 

I | the language of the translation furnished for the purposes of international preliminary exarninaaon(under Rules 
55.2 and/or 55.3). 

3. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the 
international preliminary examination was carried out on the basis of the sequence listing: 

I I contained in the international application in printed form. 

I | filed together with the international application in computer readable form. 

I | furnished subsequently to this Authority in written form. 

I | furnished subsequently to this Authority in computer readable form. 

I | The statement that the subsequendy furnished written sequence listing does not go beyond the disclosure in the 
international application as filed has been furnished. 

I I The statement that the information recorded in computer readable form is identical to the written sequence listing 
has been furnished. 

4. \^ The amendments have resulted in the cancellation of 

I | the description, pages NONE 
I | the claims, Nos. NONE 
I | the drawings, sheets/fig NONE 

5 . 1 | This report has been established as if (some of) the amendments had not been made, since they have been considered to go 

beyond the disclosure as filed, as indicated in the Supplemental Box (Rule 70.2(c)).** 
* Replacement sheets which have been furnished to the receiving Office in response to an invitation under Article 14 are referred to in 
this report as u originally filed" and are not annexed to this report since they do not contain amendments (Rides 70.16 and 70.17). 
** Any replacement sheet containing such amendments must be referred to under item 1 and annexed to this report. 
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stria) applicability; 



V. Reasoned statement under Rule 66.2(a)(ii) with regard to n velty, inventive step or indus 
citati ns and explanations supporting such statement 

1. STATEMENT 



Novelty (N) 



Claims NONE 



Claims 1-9 



_YES 
NO 



Inventive Step (IS) 



Claims NONE 



Claims 1-9 



_YES 
NO 



Industrial Applicability (IA) 



Claims 1-9 



Claims NONE 



_YES 
NO 



2. CITATIONS AND EXPLANATIONS 
References cited in Search Report: 

BASSETT JR. et al. Gene Expression Informatics- it's all in your Mine. 01 January 1999, Nature Genetics, Vol. 21, Supplement, 
pages 51-55. 

DEBOUCK et. al. DNA microarray in drug discovery and development. 01 January 1999, Nature Genetics, Vol. 21, Supplement, 
pages 48-50. 

IYER, V.R. et al. The Transcriptional Program in the Response of Human Fibroblasts to Serum. 01 January 1999. Science, Vol. 
283, pages 83-87. 

Claims 1-9 lack novelty under PCT Article 33(2) as being anticipated by SCHILLING et al. This document discloses biochemical 
pathway data managing systems. These systems allow for various types of map displays ihustrating the linkages between sequences, 
genes, enzymes, activities, etc. Information from gene expression profiles, once validated and assigned a function, can also be 
included. Substrate, catalyst and product can all be included. 

Claims 1-9 lack novelty under PCT Article 33(2) as being anticipated by WO 96/29659 Al (KUREHA) . This document discloses 
data management and processing systems which link and associate members of biosynthetic or biochemical pathways . The data can 
describe substrates, catalysts, cofactors, enzymes, products, and/or proteins. The data can be arranged in varying formats, depending 
on the intended results or chosen display means. The Figures set forth varying processing flow charts which illustrate the disclosed 
methods. 

Claims 1-9 lack an inventive step under PCT Article 33(3) as being obvious over BASSETT Jr. et al. in view of DEBOUCK et al., 
IYER et al. and KAWAMOTO et al. BASSETT Jr. and discloses methods of obtaining gene expression data for thousands of genes 
under various conditions, and also discloses database systems and dataroining approaches for organizing that data into biologically 
relevant pathways. BASSETT Jr discloses various sources for database management and analysis software. DEBOUCK et al. , 
IYER et al. And KAWAMOTO et al. each disclose particular situations wherein microarray gene expression profiling data was used 
to identify members in various biochemical or cellular pathways. DEBOUCK et al. discuss strategies for identifying genes related to 
diseases, how to set up a model system, and how to identify genes affected by a drug treatment. IYER et al. disclose experiments 
wherein the gene expression profile of serum exposed fibroblasts is compared to the profile of normal cells. The data is integrated by 
computer, and a variety of gene clusters were identified and grouped together. KAWAMOTO et al. disclose the expression profiles 
of active genes in normal liver cells, and the use of that data in monitoring liver response to drug therapy, etc. As the art discloses 
data processing systems for processing biologically related data into biological pathways or responses and specific methods of 
implementing such systems, the claims lack an inventive step over the cited art. 



NEW CITATIONS 



SCHILLING, C.H. et al. The Underlying Pathway Structure of Biochemical Reaction Networks. PNAS USA. 01 April 1998, Vol. 95 
pages 4193-4198, see entire document. 

WO 96/29659 Al (KUREHA K.K.K.K.) 26 September 1996, see entire document. 
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VI. Certain documents cited 



1 . Certain published documents (Rule 70. 10) 

Publication Date 



Application No 
Patent No. 

US 5,980,096 A 
US 6,183,959 Bl 
US 6,199,017 Bl 



(day /month/year) 
09 November 1999 

(09.11.1999) 
06 February 2001 

(06.02.2001) 
06 March 2001 (06.03.2001) 



Filing Date 
(day /month/year) 
17 January 1995 (17.01. 1995) 
08 My 1998 (08.07.1998) 
16 September 1997 
(16.09.1997) 



Priority date (valid claim) 
(day /month/year) 
None 
None 
None 



2. Non-written disclosures (Rule 70.9) 

Date of written disclosure referring to 
Date of non- written disclosure non- written disclosure 

Kind of non-written disclosure (day /month/year) (day /month/year) 
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VIE. Certain observati ns n the international application 



The following observations on the clarity of the claims, description, and drawings or on the questions whether the claims 
are fully supported by the description, are made: 

The description is objected to under PCT Rule 66. 2(a) (v) as lacking clarity under PCT Article 5 because it fails to adequately enable 
practice of the claimed invention because: the disclosure does not set forth how the association mechanism, or grading mechanism 
are to be implemented. The disclosure does not set forth how the data is to be sifted through, in order to find valid linkages or 
associations between any two members, or even groups of members. The disclosure does not set forth how to manipulate the data, 
or how to manipulate the software in order to display certain correlations of the data. Each of these functions requires some sort of 
algorithmic action, and at the least requires types of known algorithms, equation, specific flow chart, code or commands for carrying 
out the particular claimed functions. Without such description, one of skill in the art would not be able to implement the system of 
the application. 

Claims 1-9 are objected to as lacking clarity under PCT Rule 66. 2(a) (v) because practice of the claimed invention is not enabled as 
required under PCT Rule 5. 1(a) for the reasons set forth in the immediately preceding paragraph. 
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